Which of the following are valid scoring methods for attention?
Traveling salesman
Dot product
What's the intuition behind using dot product as a scoring method?
The usefulness of the commutative property of multiplication
The dot product of two vectors in word-embedding space is a measure of similarity between them
Next Concept